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RACIAL DIFFERENCES IN MEASUREMENT ERROR 
IN EDUCATIONAL ACHIEVEMENT MODELS 

ABSTRACT 

I 

fc Racial differences in the accuracy of reports of parental status 

charaHeristics are investigated using Joreskog ? s (1971a) general 

l 

framework for simultaneous covariance structure analyses of multiple 
populations. Results indicate that white's reports of these 
characteristics are significantly more reliable than are those of blacks. 
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RACIAL DIFFERENCES IN MEASUPEMENT ERROR 
IN EDUCATIONAL ACHIEVEMENT MODELS 

Reliable and theoretically meaningful measurement is a prerequisite 

>— 

for good educational research; yet it has not always assumed the 
prominence it deserves. The linear statistical model is probably the most 
commonly used analytic tool in educational research, but using the linear 
statistical model carries with it a clear, but often overlooked, assumption 
about measurement error. As Blalock (1964, p. 49) noted, one assumes 
that "there may be r errors of measurement wit i respect to the dependent 
variable Y, but that all of the independent variables have been measured 
without error." Such an assumption is obviously unrealistic for most 
social data, and has a well-known effect upon the least-squares 
estimators — they are biased (Walker ancTLev, 1953, p. 306). 

Until recently, there was little that an educational researcher^fcould 
do abdut least-squares indicators biased by measurement error. There 
were basically only three alternatives. By far the most common was to 
naively assume that the variables were measured without error, and 
wistfully hope the resulting estimates were robust. A second alternative 
was to correct correlation Coefficients for attenuation, and use the 

corrected* estimates as inputs to a regression analysis. This procedure, 

a 

however, required a priori knowledge of the reliability coefficients for 
the variables; furthermore, one had to assume the reliabilities were 
invariant from one population, subpopulation, or sample to the one at 
hand. These restrictions have severely limited the use of regression 
analyses based on correlations corrected for attenuation. 
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Yet a third alternative was to measure implied coefficients between 
latent variables for which one had multiple Manifest indicators. Siegel ? 
and Hodge (1968), for example, explicated several such models in their 
paper directed to sociologists; furthermore, they noted that correlations 
corrected for attentuation "were merely special cases of their multiple 
indicator models. The problem with this alternative, as noted by both 
Hauser and Goldberger (1971) and Long (1976), is its casual approach 
toward statistical estimation and hypothesis testing. The problem results- 
from overidentified models, which yield multiple estimates of the 
associations among latent variables. In response, some authors have * 
chosen*to ignore one or more of the identifying equations (e.g., Blalock, 
1970; Land, 1970); others have averaged the estimates from the several 
equations (e.g., Hauser, 1970). A bettei/alternative would be to obtain 
estimates of the overidentified parameters by maximum likelihood 
estimation (MLE). These procedures grew out of the work of LawteV 
(1943), but the immense computational load required for the iterative 
estimation of maximum-likelihood estimates prevented Ahexr application in 
practice. Thus, the application of more adequate statistical procedures 
languished until Joreskog (1966, 1967, 1969) discovered an efficient MLE 
computational procedure, soon to be followed by a computer program for 
confirmatory factor analyst (Joreskog, Gruvaeus, and van Thillo, 1970). 

0 

The resulting variances and covariances of the latent factors could be 
used to estimate the parameters of a structural model assumed to exist 
among, the factors, and Joreskog and Sorbom (1978) have provided a # 
computer program for both a confirmatory factor analysis measurement 
model, and ei hypothesized linear structural model among the factors. 



This program is called LISREL, an acronym for linear structural 
relationships, and possesses the potential for revolutionizing the way 
educational researchers test^ hypothesized relationships among theoretical, 
unmeasured latent variables (Kerlinger, 19/7) . 

These analytic developments have direct application to substantive 
problems in educational research. Of particular importance, models of 
educational achievement often include measures of socioeconomic 
background. Otherwise, outcomes attributed to treatments within 
•schools, for example, could be due to differences in ocioeconomic 
background. If the socioeconomic* indicators are measured with error,. 
¥ however, their least-squares estimates will be biased. And when models 
are compared across populations, or across different cohorts of the same 
population, biases can be compounded by differential kinds and 

c 

magnitudes of measurement error in the several populations. 

Several studies suggest that measurement errors in socioeconomic 
background variables differ between blacks and whites. For example, 
Bielby et al. (1977) estimated response error models for measures of 
socioeconomic attainment in black and nonblack populations. Their 
results suggested that nonblacks report socioeconomic variables with 
random errors^ but blacks report *uch variables with greater consistency 
than warranted in reality. That is, if blacks overstate, say, their 
fathers education they are also likely to overstate their mothers. Woifle 
and Lichtman (1981) did not find significant nonrandom measurement 
errors among either whites or blacks, but they did find that blacks 
reported status levels of their parents with greater errors than did 
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whites. These differences were not trivial in their impact on structural t 
parameter estimates; Patteson and Wolfle (1981) found that* least-square 
regression estimates could be biased by as much as 100 percent when 
compared to estimates adjusted for the presence of measurement error. 

These comparisons of measurement error between blacks and whites 
are deficient, however, because the authors estimated models 
independently for each grc^jp. Estimating measurement models 
independeatly for each group is deficient for at least two reasons. 
First, it precludes statistical tests of group deferences in measurement 
model parameters. Second, where differences are not found across 
gfoups, better parameter estimates are obtained by equating coefficients 
across groups, thus increasing the number of observations while using 
fewer degrees of freedom. 

A statistical model appropriate for comparing measurement model 
parameter estimates across groups was proposed by Joreskog (1971a), 
but to date has received little use in educational research. Mare and 
Mason C1980) have applied Joreskogs (1971a) analytic framework to 
^children's reports of parental characteristics involving white children in> 
the sixth, ninth and twelfth grades. Corcoran (1980) has studied sex 
differences in measurement errors for socioeconomic variables. But no 
one has used Joreskog's (1971a) framework to investigate measurement 
error differences between blacks and whites. This paper makes explicit 
statistical tests of measurement model similarities and differences between 
blacks an£ whites for variables typically found in models of educational 
achievement. 
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THE DATA < ' . 

Data for this study were drawn from the National Longitudinal 
Study -of the High School Class of 1972 (see Riccobono, et al., 1981). 
The NLS was designed to provide data on a large cohort of high school 
seniors, and to follow these students as they made the move from high 
school into their eafrly years of adulthood. The variables used in this 
analysis include respondent's reports of their father's and mother's 
educational attainment in both the base-year (1972) and first follow-up 

c 

(1973) surveys, and father's occupation measured at the same times in 
terms of Duncan's (1961) socioeconomic index as adjusted to the 1970 
census occupational classification. The analysis reported here is 
restricted to 647 black respondents who possessed complete reports for 
these six variables, and 650 white respondents selected at random from 
the nearly 8000 white* with complete data. 
STATISTICAL MODELS 

For each race, the basic measurement model can be described by 
the following six equations: 
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For each true parental status characteristic this is at least a 
congeneric measures model (Joreskog, 1971b). True scores were allowed 
to ccvary, and were not constrained to be° equal for blacks and whites, 
Covariances among response errors were initially set at zero on the* 
assumption ttfat response errors were random, but were subsequently 
allowed to covary. The statistical tests available for selecting, the t?est- 
fitting model consist of (1) estimating a model in which certain 
parameters are set to be equal, sometimes within a racial group, and 
sometimes across the two groups; and (2) estimating a less constrained 
model. The test consists of assessing the statistical significance of the 
improvement in fit going from the more constrained model to the less* 
constrained model, the more constrained model fits the data as well as 
the less constrained model (i.e., within sampling error limits), then one 
may conclude that the constraints do not seriously erode the fit of the 
model. Suppose, for example, that one model is specified such that the 
reliabilities of fathers education are constrained to be equal for whites 
and blacks; another model is . specified such that these reliabilities are 
free to vary between races. If the constrained model provides just as 
good a fit as the free model, then constraining equal reliabilities does 
not erode the fit of the model to the data, and one would conclude that 
the reliabilities of father's education were equal for whites and blacks. 
RESULTS 

The strategy of analysis we followed was to specify a series of 

T J 

measurement models reflecting alternative assumptions about the patterns 
of errors with which black and white high school seniors reported the 



7 

status characteristics of their parents. In matrix algebra notation, the 
model being tested is: 

o ° ' 

' X ■ A F + 6 * \'\ 

\ 

in which X is a 6 x 1 vector of reported parental status characteristics, 
A is a 6 x 3 matrix of coefficients, F is a 3 x 1 vector of true parental 
status characteristics, and 6 is a 6 x 1 vettor of disturbances.- 'The 
covariance matrix for the status reports implied by the moc^el is then: 

E * A * A + 0 . 

o 

r • 

The simplest possible pattern of errors is to assume that all errors 
were random. In this case, theta-delta, the variance-covariance matrix 
of disturbances, was specified to be a diagonal matrix, implying that all 
error covariancgs were zero. All free coefficients were allowed to vary 
both within and across groups. The two variancte-covariance matrices of 
observed reports for blacks and whites contain 21 nonredundant 
coefficients, and within groups there are 15 coefficients being estimated 
— six in lambda, three in phi, and six in theta-delta. The variances in 
phi were fixed at unity to provide standardized estimates. 

Table 1 presents goodness-of-fit statistics for this and several 
other models. Model A of Table 1 is the model of random errors just 
described. The likelihood ratio chi-square value of the model of random 
errors was 74.51 with 12 degrees of freedom, indicating that random 
reporting errors were very unlikely to have generated the observed 
covariance matrices. 



Since errprs were no? random, it became necessary to considef 
several forms of non randomness . One form of nonrandomrress U to 
assume in the NLS base-year survey (1972) that reporting errors of 
fathers and mother's education were correlated. This could occur, for 
example, if a respondent knew one parent s educational attainment but 
not the other s/ and guessed the unknown with reference to the known. 
Another form of nonrandomness could occur if respondents reported their 
fathers educational and occupational status with more consistency than 
warranted by the truth. Model B in Table 1 was estimated with the 
covariances between the disturbances of mother's and fathers education, 
and father s education and occupation, in the base-year allowed to be 
nonzero and estimable by the model for both whites Nw and blacks. The 
likelihood ratio chi-square was 11.02 with -S^degrees of freedom. This 

* 3 

represents a major improvement in the fit of the model, and indicates- 
that non random reporting errors $of parental status variables were 
characteristic of this sample of h|gh school seniors. 

' Of course, it was also possible for parental status variables to be 
reported with nonrandom errors in^the first follow-up survey (1973). 
Model C in Table 1 represents such a model, and is also a plausible 
alternative to a model of random errors. It is even possible that 
correlated errors existed in both surveys, and Model D reports these 

results. The fit of this model to the observed data is very good, and 

i 

represents a point of departure for considering whether reporting errors 
of parental status characteristics were equal for blacks and whites. 



Model £ -in Table 1 was estimated "with all elements (as specified in 

Model D) in lambda and theta-delta specified to be equal 'for both| blacks 

and v&fites. These constraints are\ equivalent to specifying chat the 

reliability coefficients for whites and blacks are equal. The likelihood 

ratio chi-square value for this model" was 125.73 with 20, degrees of 

■ 

fpeedom, indicating that whites and blacks do not report parental status 
characteristics with equal reliabilities. 

To this point we hav& assessed the adequacy of several 
ifieasurement models merely by examining their fit to the. observed data/ 
But we are also interested in finding, if possible, a parsimonious model. 
Jpward tMsXAnd, we examined the standard errors of the* e*ror 
covariances in Model D (not shown here), and found that all but two of 
the error covariances were not significantly different from zero. The 
significant covariances were between the respondent's errors in reporting 
father's and mother's education in the first follow-up survey for both 
blacks and whites. Model F was suggested by these observations. In 
Model F the covariances between the disturbances* of mothers and 
father's education in the follow-up survey for both blacks and yvhites 
were allowed to be free, but all other error covariances were fixed at 
zero. As one would expect, Model F does not fit as well as Model D, 
but the difference in the likelihood ratio chi-square values, which itself 
is distributed as chi-square, is not statistically significant (the 
probability of ,a chi-square value of 8.30 wit!* 3 degrees of freedom is 
.217). ^This result indicates that Model F is to be preferred over Model 
D; the two models do not differ significantly in fit, but Model ~F uses 
fewer degrees of freedom. 
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Why the disturbances of mother's and father's education were 
correlated in the first follow-up survey, but not the base-year survey, 
is a bit of a mystery. The method of data collection, however, differed 
between the base-year and follow-up surveys, and may have contributed 
to the anomaly. In the base-year the respondents were giv*n a 
questionnaire in school, and then given the option of \completing the 
questionnaire in school or taking it home and answering the questions 
with the assistance of their parents. In contrast, the follow-up survey 
was conducted primarily by mail, and may not have affoHed the 
respondents the opportunity to resolve uncertainties about their parents' 
education. In any ^ven^Mhe reporting errors of mother's and father's 
3ducation were correlatSl in the follow-up survey. 

We next considered whether the reliabilities of the observed status 
variables were equal for the two surveys. To effect this analysis, we 
respecified MMel F such that the lambda coefficients and the error 
vawjmces in theta-delt* withiyracial groups were constrained to be equal 
• for "each of the two indicators for each of the latent true statuses. For 
example, the reliability of father's education observed in the base-year 
was constrained to be equal to the reliability of fathers education 
observed in the foNow-up. No constraints were made across either latent 
status variables Or racial groups. Model G in Table 1, which reports 
this analysis, provides a more parsimonious fit than Model F. Although 
the chi-square value for Model G is larger than that of Model R, Modei G 
uses fewer decrees of freedom; the difference in chi-square vatues is not 
statistically significant. 
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The reliability coefficients in Model G (n^t shown here) suggested 
the possibility that mother's and father's educations were reported by 
th* NLS respondents with equal reliability. The fit of Model H, 
however, which specified these equslitjtes separately for both whites \nd 
blacks, is not Very good. But Model I, which specifies equal reliabilities 
for mother's and father's education for blacks, but not for whites, 
provides an excellent fit to the observed data. Model I is a parsimonious 
model that fits the data well, -and became our final measurement error 
model of pe rental status characteristics. 

Parameter and reliability estimates for Model I are shown in Table 
2. First, these estimates indicate that whites report their parents 1 
status characteristics more reliably than do blacks. Blacks are only 
about 90 percent as reliable resoondents of parental status as whites. 
Second, the estimates show that all respondents reported their parents 1 
education more accurately than they reported their father's occupation. 
Father's occupation was reported only about 80 percent as reliably as 
either father's or mother's education. Finally, blacks do not favor either 
parent in reporting their educational attainment, but wfvites report their 
father's education slightly more accurately than their mother's. 

The middte panel of Table 2 shows the correlation coefficients 
among the three parental status true scores. These correlations among 
the latent or true status characteristics are all larger in value than their 
counterparts among the observed variables. This is not unexpected, as 
the-obsetved variables contain considerable portions of ranoom (and some 
nonrandom) error. . Because blacks report these variables less reliably 
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thaw whites, the correlations among the observed variables for blacks are 
considerably more attenuated than fop whites. This has serious 
consequences for substantive conclusions to be drawn from least-squares 
estimates of parameters in models of educational achievement. In 
particular, when comparisons are made between whites and blacks, the 
* least-squares parameter estimates will var\^ % as a function of differential 
measurement error even if the true effects are the same for both groups. » 

The final panel of Table 2 presents the estimated correlations 
between errors in reports of mother's and father's schooling for whites 
and blacj^. These correlations are substantial, and suggest that the 
NLS respondents surveyed by mail questionnaire resolved their 
uncertainty about parental schooling by reconciling their parents 1 
schooling. 
CONCLUSION 

This paper has illustrated the use of simultaftequs factor analytic 
methods developed by Joreskog (1971a) in estimating measurement models 
of parental status characteristics as reported by white and black high 
school seniors. By estimating measurement parameters for both groups 
simultaneously, we have been able to %nake explicit statistical tests of 
group differences in measurement error patterns. These tests indicate 
that whites report their parents 1 status characteristics more reliably than 
blacks. These results suggest that comparisons between whites and 
blacks of models of educational achievement are suspect if they are based 
solely on least-squares estimates. Such estimates are biased by the 
presence of measurement error; because measurement error varies by 
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race; interracial comparisons' of least- squares structural parameter 
estimates are biased* 
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Table U Goodnessrof--£tt-Statis-tics-^for Measureraent-Modeas- for -Black 
and White 1972 High School Graduates 



Model 



Degrees of 
Freedom 



Prob. 



A. Random measurement errors 

B. Covariances among errors of father's 
and mother's education, and father's 
education and occupation in base- 
year survey 

C. Covariances among errors of father's 
and mother's education, and father's 
education and occupation in 
fdllowup survey 

D. Covariances among errors of father's 
and mother's education, anJ father's 
education and occupation in base- 
year and followup surveys 

E. Model D with black and white measure- 
ment error structures constrained 

to be equal 

F. Covariances among errors of father's 
and m r Mier's education in followup 
surv „ 

.G. Model F with reliabilities of father's 
education equated, mother's education 
equated, and father's, occupation 
equated 

H. Model G with reliabilities of father's 
and mother's education equated to 
each other 

I. Model Gwith reliabilities of father's 
and mother's education equated to 
each other for blacks only 



74.51 



12 



.000 



11.02 



7.25 



1 .99 



125.73 



10.29 



17.49 



32.46 



17.50 



8 



8 



20 



10 



22 



26 



24 



.200 



.510 



.738 



.000 



.415 



.736 



.178 



.826 
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table 2. 'Model I Parameter Estimates 



Characteristics Manifest True Score Error sl -Reliability 
uiaraccen sties Measure Variance Variance K 



Whi tes 



Father's 
education 

Mother's 
education 

Father's 
occupation 



Base-year 
Followup 

Base-year 
Followup 

Base-year 
Followup 



1.00 
1.00 
1.00 



.088 
.088 

.119 
.119 

.270 
.270 



.955 
.955 

.940 
.940 

.854 
.854 



.916 
.916 

.888 
.888 

.760 
.760 



Blacks 

Father's 
education 

- Mother's 
education 

Father's 
occupation 



Base-year 
Followup 

Base-year 
rollowup 

Base-year 
Followup 



1.00 
1.00 
1.00 



.175 
.175 

.175 
.175 

.390 
.390 



.910 
.910 

.910 
.910 

.780 
.780 



.839 
.839 

.839 
.839 

.667 
.667 



True Score Correlations* 







1. 


2. 


3. 


1. 


Father's Educ. 


1.00 


.635 


.738 


2. 


Mother's Educ. 


.594 


1.00 


• .495 


3. 


Father's Occup. 


.730 


.468 


1.00 



* Whites below the diagonal,- and blacks above. 



Correlations between Errors in Reports of Mother's and Father's. 
Education in First Followup 



Whi tes 
Blacks 



Correlations 
.322 
.400 
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